SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANTS: Morrow, Casey D. and Porter, Donna, C, 

(ii) TITLE OF INVENTION: ENCA P S I DATE D RECOMBINANT POLIOVIRUS 

NUCLEIC ACID AND METHODS OF MAKING AND 
USING SAME 

(iii) NUMBER OF SEQUENCES: 23 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD 

(B) STREET: 60 STATE STREET, SUITE 510 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: ASCII 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 0 000 0 0 
<B) FILING DATE: 15-FEB-1995 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/087,009 

(B) FILING DATE: 01-JUL-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION : 
(A) NAME: Silver! , Jean M. 

<B) REGISTRATION NUMBER: P-39,030 

(C) REFERENCE/DOCKET NUMBER: UAG-004CP 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 



) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TATTAGTAGA TCTG 14 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

TACAGATGTA CTAA 14 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 84 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 20 . . 845 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ACACAGCAAT CAGGTCAGC CAA AAT TAC CCT ATA GTG CAG AAC ATC CAG GGG 

Gin Asn Tyr Pro lie Val Gin Asn lie Gin Gly 
15 10 

CAA ATG GTA CAT CAG GCC ATA TCA CCT AGA ACT TTA AAT GCA TGG GTA 
Gin Met Val His Gin Ala lie Ser Pro Arg Thr Leu Asn Ala Trp Val 
15 20 25 

AAA GTA GTA GAA GAG AAG GCT TTC AGC CCA GAA GTG ATA CCC ATG TTT 
Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe 
30 35 40 

TCA GCA TTA TCA GAA GGA GCC ACC CCA CAA GAT TTA AAC ACC ATG CTA 
Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu 
45 50 55 
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AAC ACA GTG GGG GGA CAT CAA GCA GCC ATG CAA ATG TTA AAA GAG ACC 244 
Asn Thr Val Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr 
60 65 70 75 

5 ATC AAT GAG GAA GCT GCA GAA TGG GAT AGA GTG CAT CCA GTG CAT GCA 2 92 

He Asn Glu Glu Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala 
80 85 90 

GGG CCT ATT GCA CCA GGC CAG ATG AGA GAA CCA AGG GGA AGT GAC ATA 340 
10 Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He 
95 100 105 

GCA GGA ACT ACT AGT ACC CTT CAG GAA CAA ATA GGA TGG ATG ACA AAT 3 88 

Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn 
15 110 115 120 

AAT CCA CCT ATC CCA GTA GGA GAA ATT TAT AAA AGA TGG ATA ATC CTG 436 
Asn Pro Pro He Pro Val Gly Glu lie Tyr Lys Arg Trp lie He Leu 
125 130 135 

20 

GGA TTA AAT AAA ATA GTA AGA ATG TAT AGC CCT ACC AGC ATT CTG GAC 4 84 

Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr Ser lie Leu Asp 

140 145 150 155 

25 ATA AGA CAA GGA CCA AAG GAA CCC TTT AGA GAC TAT GTA GAC CGG TTC 532 
He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe 
160 165 170 

TAT AAA ACT CTA AGA GCC GAG CAA GCT TCA CAG GAG GTA AAA AAT TGG 580 
30 Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp 
175 180 185 

ATG ACA GAA ACC TTG TTG GTC CAA AAT GCG AAC CCA GAT TGT AAG ACT 62 8 

Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr 
35 190 195 200 

ATT TTA AAA GCA TTG GGA CCA GCG GCT ACA CTA GAA GAA ATG ATG ACA 67 6 

lie Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr 
205 210 215 

40 

GCA TGT CAG GGA GTA GGA GGA CCC GGC CAT AAG GCA AGA GTT TTG GCT 724 
Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala 
220 225 230 235 

45 GAA GCA ATG AGC CAA GTA ACA AAT TCA GCT ACC ATA ATG ATG CAG AGA 772 
Glu Ala Met Ser Gin Val Thr Asn Ser Ala Thr lie Met Met Gin Arg 
240 245 250 

GGC AAT TTT AGG AAC CAA AGA AAG ATT GTT AAG TGT TTC AAT TGT GGC 82 0 

50 Gly Asn Phe Arg Asn Gin Arg Lys He Val Lys Cys Phe Asn Cys Gly 
255 260 265 

AAA GAA GGG CAC ACA GCC AGA AAG T 846 
Lys Glu Gly His Thr Ala Arg Lys 
55 270 275 
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(2) INFORMATION FOR SEQ ID NO:4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Gin Asn Tyr Pro lie Val Gin Asn lie Gin Gly Gin Met Val His Gin 
15 10 15 

Ala lie Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu 

20 25 30 

Lys Ala Phe Ser Pro Glu Val lie Pro Met Phe Ser Ala Leu Ser Glu 
35 40 45 

Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly 
50 55 60 

His Gin Ala Ala Met Gin Met Leu Lys Glu Thr lie Asn Glu Glu Ala 
65 70 75 80 

Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro lie Ala Pro 
85 90 95 

Gly Gin Met Arg Glu Pro Arg Gly Ser Asp lie Ala Gly Thr Thr Ser 
100 105 110 

Thr Leu Gin Glu Gin lie Gly Trp Met Thr Asn Asn Pro Pro lie Pro 
115 120 125 

Val Gly Glu lie Tyr Lys Arg Trp lie lie Leu Gly Leu Asn Lys lie 
130 135 140 

Val Arg Met Tyr Ser Pro Thr Ser lie Leu Asp lie Arg Gin Gly Pro 
145 150 155 160 

Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg 
165 170 175 

Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu 
ISO 185 190 

Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala Leu 
195 200 205 

Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val 
210 215 220 

Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin 
225 230 235 240 

Val Thr Asn Ser Ala Thr lie Met Met Gin Arg Gly Asn Phe Arg Asn 

245 250 255 
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Gin Arg Lys lie Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His Thr 
260 265 270 

Ala Arg Lys 
275 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 4 . . 946 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



AAC CAA TGG CCA TTG ACA GAA GAA 
Gin Trp Pro Leu Thr Glu Glu 
1 5 

TGT ACA GAG ATG GAA AAG GAA GGG 
Cys Thr Glu Met Glu Lys Glu Gly 
20 



AAA ATA AAA GCA TTA GTA GAA ATT 
Lys He Lys Ala Leu Val Glu He 
10 15 

AAA ATT TCA AAA ATT GGG CCT GAA 
Lys He Ser Lys He Gly Pro Glu 
25 30 



AAT CCA TAC AAT ACT CCA GTA TTT GCC ATA AAG AAA AAA GAC AGT ACT 
Asn Pro Tyr Asn Thr Pro Val Phe Ala He Lys Lys Lys Asp Ser Thr 
35 40 45 

AAA TGG AGA AAA TTA GTA GAT TTC AGA GAA CTT AAT AAG AGA ACT CAA 
Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin 
50 55 60 

GAC TTC TGG GAA GTT CAA TTA GGA ATA CCA CAT CCC GCA GGG TTA AAA 
Asp Phe Trp Glu Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys 
65 70 75 

AAG AAA AAA TCA GTA ACA GTA CTG GAT GTG GGT GAT GCA TAT TTT TCA 
Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 
80 85 90 95 

GTT CCC TTA GAT GAA GAC TTC AGG AAG TAT ACT GCA TTT ACC ATA CCT 
Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro 
100 105 HO 

AGT ATA AAC AAT GAG ACA CCA GGG ATT AGA TAT CAG TAC AAT GTG CTT 
Ser He Asn Asn Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu 
115 120 125 
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CCA CAG GGA TGG AAA GGA TCA CCA GCA ATA TTC CAA AGT AGC ATG ACA 432 
Pro Gin Gly Trp Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr 
130 135 140 

5 AAA ATC TTA GAG CCT TTT AGA AAA CAA AAT CCA GAC ATA GTT ATC TAT 48 0 

Lys He Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp He Val He Tyr 
145 150 155 

CAA TAC ATG GAT GAT TTG TAT GTA GGA TCT GAC TTA GAA ATA GGG CAG 528 
10 Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin 
160 165 170 175 

CAT AGA ACA AAA ATA GAG GAG CTG AGA CAA CAT CTG TTG AGG TGG GGA 5 76 

His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
15 180 185 190 

CTT ACC ACA CCA GAC AAA AAA CAT CAG AAA GAA CCT CCA TTC CTT TGG 6 24 

Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
195 200 205 

20 

ATG GGT TAT GAA CTC CAT CCT GAT AAA TGG ACA GTA CAG CCT ATA GTG 6 72 

Q Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val 
:/| 210 215 220 

(4$ CTG CCA GAA AAA GAC AGC TGG ACT GTC AAT GAC ATA CAG AAG TTA GTG 72 0 

Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp lie Gin Lys Leu Val 

225 230 235 

GGG AAA TTG AAT TGG GCA AGT CAG ATT TAC CCA GGG ATT AAA GTA AGG 768 
!W Gly Lys Leu Asn Trp Ala Ser Gin He Tyr Pro Gly lie Lys Val Arg 
;s 240 245 250 255 

]...& CAA TTA TGT AAA CTC CTT AGA GGA ACC AAA GCA CTA ACA GAA GTA ATA 816 

* Gin Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He 
3$ 260 265 270 

\ iS f CCA CTA ACA GAA GAA GCA GAG CTA GAA CTG GCA GAA AAC AGA GAG ATT 8 64 

^ Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He 
275 280 285 

40 

CTA AAA GAA CCA GTA CAT GGA GTG TAT TAT GAC CCA TCA AAA GAC TTA 912 

Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
290 295 300 

45 ATA GCA GAA ATA CAG AAG CAG GGG CAA GGC CTCGAG 94 8 

lie Ala Glu He Gin Lys Gin Gly Gin Gly 
305 310 



50 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 314 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

{ii> MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Gin Trp Pro Leu Thr Glu Glu Lys lie Lys Ala Leu Val Glu lie Cys 
5 1 5 10 15 

Thr Glu Met Glu Lys Glu Gly Lys lie Ser Lys lie Gly Pro Glu Asn 
20 ,25 ?0 

10 Pro Tyr Asn Thr Pro Val Phe Ala lie Lys Lys Lys Asp Ser Thr Lys 
35 40 45 

Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp 
50 55 60 

15 

Phe Trp Glu Val Gin Leu Gly lie Pro His Pro Ala Gly Leu Lys Lys 
65 70 75 80 

Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val 
20 85 90 95 

Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr lie Pro Ser 
100 105 110 

25 He Asn Asn Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu Pro 
115 120 125 

Gin Gly Trp Lys Gly Ser Pro Ala lie Phe Gin Ser Ser Net Thr Lys 
130 135 140 

30 

He Leu Glu Pro Phe Arg Lys Gin Asn Pro Asp He Val He Tyr Gin 
145 150 155 160 

Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin His 
35 165 170 175 

Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly Leu 
180 185 190 

40 Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp Met 
195 200 205 

Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val Leu 
210 215 220 

45 

Pro Glu Lys Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val Gly 
225 230 235 240 

Lys Leu Asn Trp Ala Ser Gin He Tyr Pro Gly lie Lys Val Arg Gin 
50 245 250 255 

Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He Pro 
260 265 270 

55 Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He Leu 
275 280 285 
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Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu lie 
290 295 300 

5 Ala Glu lie Gin Lys Gin Gly Gin Gly Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO : 7 : 

10 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



15 



(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE; 
20 (A) NAME / KEY : CDS 

(B) LOCATION: 7.. 1565 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

25 

GGGGCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 4 8 

Cys Pro Lys Val Ser Phe Glu Pro lie Pro He His Tyr Cys 
15 10 

30 GCC CCG GCT GGT TTT GCG ATT CTA AAA TGT AAT AAT AAG ACG TTC AAT 96 
Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn 
15 20 25 30 

GGA ACA GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA 144 
35 Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly 

35 40 45 

ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTA AAT GGC AGT CTA GCA 192 
He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala 
40 50 55 60 

GAA GAA GAG GTA GTA ATT AG A TCT GTC AAT TTC ACG GAC AAT GCT AAA 240 
Glu Glu Glu Val Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys 
65 70 75 

45 

ACC ATA ATA GTA ^C AG CTG AAC ACA TCT GTA GAA ATT AAT TGT ACA AGA 288 
Thr He He Val Gin Leu Asn Thr Ser Val Glu He Asn Cys Thr Arg 
80 85 90 

50 CCC AAC AAC AAT ACA AGA AAA AGA ATC CGT ATC CAG AGA GGA CCA GGG 3 36 

Pro Asn Asn Asn Thr Arg Lys Arg He Arg He Gin Arg Gly Pro Gly 
95 100 105 110 

AGA GCA TTT GTT ACA ATA GGA AAA ATA GGA AAT ATG AGA CAA GCA CAT 3 84 

55 Arg Ala Phe Val Thr He Gly Lys He Gly Asn Met Arg Gin Ala His 

115 120 125 
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TGT AAC ATT AGT AGA GCA AAA TGG AAT AAC ACT TTA AAA CAG ATA GAT 432 
Cys Asn lie Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gin lie Asp 
130 135 140 

5 

AGC AAA TTA AGA GAA CAA TTC GGA AAT AAT AAA ACA ATA ATC TTT AAG 480 

Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr He He Phe Lys 

145 150 155 

10 CAA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACG CAC AGT TTT AAT TGT 528 
Gin Ser Ser Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cys 
160 165 170 

GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT AAT AGT ACT 576 
15 Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr 
175 180 185 190 

TGG TTT AAT AGT ACT TGG AGT ACT GAA GGG TCA AAT AAC ACT GAA GGA 6 24 

Trp Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly 
20 195 200 205 

AGT GAC ACA ATC ACC CTC CCA TGC AGA ATA AAA CAA ATT ATA AAC ATG 6 72 

Ser Asp Thr He Thr Leu Pro Cys Arg lie Lys Gin He He Asn Met 
210 215 220 



25 



35 



45 



TGG CAG AAA GTA GGA AAA GCA ATG TAT GCC CCT CCC ATC AGT GGA CAA 72 0 

Trp Gin Lys Val Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin 
225 230 235 



30 ATT AGA TGT TCA TCA AAT ATT ACA GGG CTG CTA TTA ACA AGA GAT GGT 
He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly 
240 245 250 



50 TTG GGA GCA GCA GGA AGC ACT ATG GGC GCA GCC TCA ATG ACG CTG ACG 
Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr 
320 325 330 
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GGT AAT AGC AAC AAT GAG TCC GAG ATC TTC AGA CTT GGA GGA GGA GAT 816 
Gly Asn Ser Asn Asn Glu Ser Glu He Phe Arg Leu Gly Gly Gly Asp 
255 260 265 270 



ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA 864 
Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys 
40 275 280 285 

ATT GAA CCA TTA GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG 912 
He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val 
290 295 300 



CAG AGA GAA AAA AGA GCA GTG GGA ATA GGA GCT TTG TTC CTT GGG TTC 960 
Gin Arg Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe Leu Gly Phe 
305 310 315 
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GTA CAG GCC AGA CAA TTA TTG TCT GGT ATA GTG CAG CAG CAG AAC AAT 1056 
55 Val Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn 
335 340 345 350 
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TTG CTG AGG GCT ATT GAG GCG CAA CAG CAT CTG TTG CAA CTC ACA GTC 1104 

Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val 
355 360 365 

5 

TGG GGC ATC AAG CAG CTC CAA GCA AGA ATC CTA GCT GTG GAA AGA TAC 1152 

Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu Arg Tyr 
370 375 380 

10 CTA AAG GAT CAA CAG CTC CTA GGG ATT TGG GGT TGC TCT GGA AAA CTC 12 00 

Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu 
385 390 395 

ATT TGC ACC ACT GCT GTG CCT TGG AAT GCT AGT TGG AGT AAT AAA TCT 124 8 

15 lie Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser 
400 405 410 

CTG GAA CAG ATC TGG AAT CAC ACG ACC TGG ATG GAG TGG GAC AGA GAA 12 96 

Leu Glu Gin lie Trp Asn His Thr Thr Trp Met Glu Trp Asp Arg Glu 
20 415 420 425 430 

ATT AAC AAT TAC ACA AGC TTA ATA CAC TCC TTA ATT GAA GAA TCG CAA 13 4 4 

lie Asn Asn Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin 

435 440 445 

25 

AAC CAG CAA GAA AAG AAT GAA CAA GAA TTA TTG GAA TTA GAT AAA TGG 13 92 

Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp 
450 455 460 

30 GCA AGT TTG TGG AAT TGG TTT AAC ATA ACA AAT TGG CTG TGG TAT ATA 144 0 

Ala Ser Leu Trp Asn Trp Phe Asn lie Thr Asn Trp Leu Trp Tyr lie 

465 470 475 

AAA TTA TTC ATA ATG ATA GTA GGA GGC TTG GTA GGT TTA AGA ATA GTT 14 8 8 

35 Lys Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val 
480 485 490 

TTT GCT GTA CTT TCT ATA GTG AAT AGA GTT AGG CAG GGA TAT TCA CCA 153 6 

Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro 
40 495 500 505 510 

TTA TCG TTT CAG ACC CAC CTC CCA ATC TCGAG 156 8 

Leu Ser Phe Gin Thr His Leu Pro He 
515 

45 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 519 atuino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Cys Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro 
1 5 io 15 

Ala Gly Phe Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr 
20 . 25 30 

Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg 
35 40 45 

Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu 
50 55 60 

Glu Val Val He Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr He 
65 70 75 80 

He Val Gin Leu Asn Thr Ser Val Glu He Asn Cys Thr Arg Pro Asn 
85 90 95 

Asn Asn Thr Arg Lys Arg He Arg He Gin Arg Gly Pro Gly Arg Ala 
100 105 i 10 

Phe Val Thr He Gly Lys He Gly Asn Met Arg Gin Ala His Cys Asn 
115 120 125 

He Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gin He Asp Ser Lys 
130 135 140 

Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr He He Phe Lys Gin Ser 
145 I 50 155 i 6 o 

Ser Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cys Gly Gly 
165 170 175 

Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe 
180 185 190 

Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp 
195 200 205 

Thr lie Thr Leu Pro Cys Arg He Lys Gin He He Asn Met Trp Gin 
210 215 220 

Lys Val Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin He Arq 

225 230 otc 

" u 235 240 

Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn 
245 2 50 255 

Ser Asn Asn Glu Ser Glu lie Phe Arg Leu Gly Gly Gly Asp Met Arq 
260 265 270 

Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu 
275 280 285 



-64- 



Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 
290 295 300 

Glu Lys Arg Ala Val Gly lie Gly Ala Leu Phe Leu Gly Phe Leu Gly 
305 310 315 320 

Ala Ala Gly Ser Thr Met, Gly Ala Ala Ser Met Thr Leu Thr Val Gin 
325 330 335 

Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu 
340 345 350 

Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly 
355 360 365 

He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys 
370 375 380 

Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys 
385 390 395 400 

Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu 
405 410 415 

Gin He Trp Asn His Thr Thr Trp Met Glu Trp Asp Arg Glu He Asn 
420 425 430 

Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin 
435 440 445 

Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser 
450 455 460 

Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys Leu 
465 470 475 480 

Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe Ala 
485 490 495 

Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser 
500 505 510 

Phe Gin Thr His Leu Pro He 
515 

(2) INFORMATION FOR SEQ ID NO ; 9 ; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

CACCCCTCTC CTACGTAACC AAGGATC 

(2) INFORMATION FOR SEQ ID NO: 10: 

SEQUENCE CHARACTERISTICS: 
(A) LFNOTH: 24 base pairs 
(E) T.'PE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GTACTGGTCA CCATATTGGT CAAC 
(2) INFORMATION FOR SEQ ID NO: 11; 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 
GGAGAGAGAT GG GAG C T CGA GCGTC 
(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GCCCCCCTAT ACGTATTGTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



(i) 



10 



(i) 



25 



-66- 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCAGTGAATT CCTAATACGA CTCACTATAG GTTAAAAGP G 0 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 
CTCTATCCTG AGCTCCATAT GTGTCGAGCA GTTTTTGGTT TAGCATTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Thr Lys Asp Leu Thr Thr Tyr Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2220 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2203 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 

5 CGA CCA GCA GAC GAG ACA GTC ACA GCA GCC TTG ACA AAA CGT TCC TGG 48 
Arg Pro Ala Asp Gin Thr Val Thr Ala Ala Leu Thr Lys Arg Ser Trp 
15 10 15 

AAC TCA AGC ACT TCT CCA-CAG AGG AGG ACA GAG CAG ACA GCA GAG ACC 96 
10 Asn Ser Ser Thr Ser Pro Gin Arg Arg Thr Glu Gin Thr Ala Glu Thr 
20 25 30 

ATG GAG TCT CCC TCG GCC CCT CCC CAC AGA TGG TGC ATC CCC TGG CAG 144 
Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys lie Pro Trp Gin 
15 35 40 45 

AGG CTC CTG CTC ACA GCC TCA CTT CTA ACC TTC TGG AAC CCG CCC ACC 192 

Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr 

50 55 60 

20 

ACT GCC AAG CTC ACT ATT GAA TCC ACG CCG TTC AAT GTC GCA GAG GGG 24 0 

Thr Ala Lys Leu Thr lie Glu Ser Thr Pro Phe Asn Val Ala Glu Gly 

65 70 75 80 

25 AAG GAG GTG CTT CTA CTT GTC CAC AAT CTG CCC CAG CAT CTT TTT GGC 28 8 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 
85 90 95 

TAC AGC TGG TAC AAA GGT GAA AGA GTG GAT GGC AAC CGT CAA ATT ATA 33 6 

30 Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gin lie lie 
100 105 110 

GGA TAT GTA ATA GGA ACT CAA CAA GCT ACC CCA GGG CCC GCA TAC AGT 3 84 

Gly Tyr Val lie Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala Tyr Ser 
35 115 120 125 

GGT CGA GAG ATA ATA TAC CCC AAT GCA TCC CTG CTG ATC CAG AAC ATC 432 

Gly Arg Glu lie lie Tyr Pro Asn Ala Ser Leu Leu lie Gin Asn lie 
130 135 140 

40 

ATC CAG AAT GAC ACA GGA TTC TAC ACC CTA CAC GTC ATA AAG TCA GAT 4 80 

lie Gin Asn Asp Thr Gly Phe Tyr Thr Leu. His Val lie Lys Ser Asp 

145 150 155 160 

45 CTT GTG AAT GAA GAA GCA ACT GGC CAG TTC CGG GTA TAC CCG GAG CTG 52 8 

Leu Val Asn Glu Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 
165 170 175 

CCC AAG CCC TCC ATC TCC AGC AAC AAC TCC AAA CCC GTG GAG GAC AAG 5 76 

50 Pro Lys Pro Ser lie Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys 
180 185 190 

GAT GCT GTG GCC TTC ACC TGT GAA CCT GAG ACT CAG GAC GCA ACC TAC 624 
Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gin Asp Ala Thr Tyr 
55 195 200 205 

CTG TGG TGG GTA AAC AAT CAG AGC CTC CCG GTC AGT CCC AGG CTG CAG 6 72 

Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg Leu Gin 
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10 



15 



20 



30 



35 



40 



50 



55 



210 215 220 

CTG TCC AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT GTC ACA AGA AAT 
Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 
225 230 235 240 

GAC ACA GCA AGC TAG AAA TGT GAA ACC CAG AAC CCA GTG AGT GCC AGG 
Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser Ala Arg 
245 250 255 

CGC AGT GAT TCA GTC ATC CTG AAT GTC CTC TAT GGC CCG GAT GCC CCC 
Arg Ser Asp Ser Val He Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro 
260 265 270 



CTC TCC TGC CAT GCA GCC TCT AAC CCA CCT GCA CAG TAC TCT TGG TTT 
Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Phe 
290 295 300 



GTC AAT GGG ACT TTC CAG CAA TCC ACC CAA GAG CTC TTT ATC CCC AAC 
Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe He Pro Asn 
25 305 310 315 320 



GAG CCA CCC AAA CCC TTC ATC ACC AGC AAC AAC TCC AAC CCC GTG GAG 
Glu Pro Pro Lys Pro Phe He Thr Ser Asn Asn Ser Asn Pro Val Glu 
355 360 365 

GAT GAG GAT GCT GTA GCC TTA ACC TGT GAA CCT GAG ATT CAG AAC ACA 
Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu He Gin Asn Thr 
- 370 375 380 

ACC TAC CTG TGG TGG GTA AAT AAT CAG AGC CTC CCG GTC AGT CCC AGG 
Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Ara 
45 385 390 395 400 

CTG CAG CTG TCC AAT GAC AAC AGG ACC CTC ACT CTA CTC AGT GTC ACA 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 

405 410 415 



AGG AAT GAT GTA GGA CCC TAT GAG TGT GGA ATC CAG AAC GAA TTA AGT 
Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser 
420 425 430 

GTT GAC CAC AGC GAC CCA GTC ATC CTG AAT GTC CTC TAT GGC CCA GAC 
Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp 
435 440 445 



720 



768 



816 



ACC ATT TCC CCT CTA AAC ACA TCT TAC AGA TCA GGG GAA AAT CTG AAC 864 
Thr He Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 
275 280 285 



912 



960 



1008 



ATC ACT GTG AAT AAT AGT GGA TCC TAT ACG TGC CAA GCC CAT AAC TCA 
He Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 
325 330 335 

GAC ACT GGC CTC AAT AGG ACC ACA GTC ACG ACG ATC ACA GTC TAT GCA 10 56 

Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr He Thr Val Tyr Ala 
340 345 350 



1104 



1152 



1200 



1248 



1296 



1344 
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GAC CCC ACC ATT TCC CCC TCA TAC ACC TAT TAC CGT CCA GGG GTG AAC 
Asp Pro Thr lie Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 
450 455 460 

CTC AGC CTC TCC TGC CAT GCA GCC TCT AAC CCA CCT GCA CAG TAT TCT 
Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 
465 470 475 4 8 o 

TGG CTG ATT GAT GGG AAC A1C CAG CAA CAC ACA CAA GAG CTC TTT ATC 
Trp Leu lie Asp Gly Asn lie Gin Gin His Thr Gin Glu Leu Phe lie 
485 490 495 

TCC AAC ATC ACT GAG AAG AAC AGC GGA CTC TAT ACC TGC CAG GCC AAT 
Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
500 505 510 

AAC TCA GCC AGT GGC CAC AGC AGG ACT ACA GTC AAG ACA ATC ACA GTC 
Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 
515 520 525 

TCT GCG GAG CTG CCC AAG CCC TCC ATC TCC AGC AAC AAC TCC AAA CCC 
Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 
530 535 540 

GTG GAG GAC AAG GAT GCT GTG GCC TTC ACC TGT GAA CCT GAG GCT CAG 
Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 
545 550 555 560 

AAC ACA ACC TAC CTG TGG TGG GTA AAT GGT CAG AGC CTC CCA GTC AGT 
Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 
565 570 575 

CCC AGG CTG CAG CTG TCC AAT GGC AAC AGG ACC CTC ACT CTA TTC AAT 
Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
580 585 590 

GTC ACA AGA AAT GAC GCA AGA GCC TAT GTA TGT GGA ATC CAG AAC TCA 
Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 
595 600 605 

GTG AGT GCA AAC CGC AGT GAC CCA GTC ACC CTG GAT GTC CTC TAT GGG 
Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 
610 615 620 

CCG GAC ACC CCC ATC ATT TCC CCC CCA GAC TCG TCT TAC CTT TCG GGA 
Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 
625 630 635 640 

GCG AAC CTC AAC CTC TCC TGC CAC TCG GCC TCT AAC CCA TCC CCG CAG 
Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 
645 650 655 

TAT TCT TGG CGT ATC AAT GGG ATA CCG CAG CAA CAC ACA CAA GTT CTC 
Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
660 665 670 
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TTT ATC GCC AAA ATC ACG CCA AAT AAT AAC GGG ACC TAT GCC TGT TTT 2064 
Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 
675 680 685 

5 GTC TCT AAC TTG GCT ACT GGC CGC AAT AAT TCC ATA GTC AAG AGC ATC 2112 
Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser lie 
690 695 < 700 

ACA GTC TCT GCA TCT GG A ' ACT TCT CCT GGT CTC TCA GCT GGG GCC ACT 2160 
10 Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 
705 710 715 720 

GTC GGC ATC ATG ATT GGA GTG CTG GTT GGG GTT GCT CTG ATA 2202 
Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
15 725 730 



TAGCAGCCCT GGTGTAGT 



2220 



20 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



30 



Arg Pro Ala Asp Gin Thr Val Thr Ala Ala Leu Thr Lys Arg Ser Trp 
15 10 15 



Asn Ser Ser Thr Ser Pro Gin Arg Arg Thr Glu Gin Thr Ala Glu Thr 
35 20 25 30 

Met Glu Ser Pro Ser Ala Pro Pro His Arg Trp Cys He Pro Trp Gin 
35 40 45 

40 Arg Leu Leu Leu Thr Ala Ser Leu Leu Thr Phe Trp Asn Pro Pro Thr 
50 55 60 

Thr Ala Lys Leu Thr He Glu Ser Thr Pro Phe Asn Val Ala Glu Gly 
65 70 75 80 

45 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 
85 90 95 

Tyr Ser Trp Tyr Lys Gly Glu Arg Val Asp Gly Asn Arg Gin He He 
50 100 105 no 

Gly Tyr Val He Gly Thr Gin Gin Ala Thr Pro Gly Pro Ala Tyr Ser 
115 120 125 

55 Gly Arg Glu He He Tyr Pro Asn Ala Ser Leu Leu He Gin Asn He 
130 135 140 
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lie Gin Asn Asp Thr Gly Phe Tyr Thr Leu His Val lie Lys Ser Asp 
145 150 155 160 

Leu Val Asn Glu Glu Ala Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 
165 170 175 

Pro Lys Pro Ser lie Ser Ser Asn Asn Ser Lys Pro Val Glu Asp Lys 
180 185 190 

Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Thr Gin Asp Ala Thr Tyr 
195 200 205 

Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg Leu Gin 
210 215 220 

Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn Val Thr Arg Asn 
225 230 235 240 

Asp Thr Ala Ser Tyr Lys Cys Glu Thr Gin Asn Pro Val Ser Ala Arg 
245 250 255 

Arg Ser Asp Ser Val lie Leu Asn Val Leu Tyr Gly Pro Asp Ala Pro 
260 265 270 

Thr lie Ser Pro Leu Asn Thr Ser Tyr Arg Ser Gly Glu Asn Leu Asn 
275 280 285 

Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser Trp Phe 
290 295 300 

Val Asn Gly Thr Phe Gin Gin Ser Thr Gin Glu Leu Phe lie Pro Asn 
305 310 315 320 

lie Thr Val Asn Asn Ser Gly Ser Tyr Thr Cys Gin Ala His Asn Ser 
325 330 335 

Asp Thr Gly Leu Asn Arg Thr Thr Val Thr Thr lie Thr Val Tyr Ala 
340 345 350 

Glu Pro Pro Lys Pro Phe lie Thr Ser Asn Asn Ser Asn Pro Val Glu 
355 360 365 

Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu lie Gin Asn Thr 
370 375 380 

Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg 
385 390 395 400 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 
405 410 415 

Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly lie Gin Asn Glu Leu Ser 
420 425 430 

Val Asp His Ser Asp Pro Val lie Leu Asn Val Leu Tyr Gly Pro Asp 
435 440 445 
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Asp Pro Thr lie Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 
450 455 460 

Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 
465 * 470 475 480 

Trp Leu lie Asp Gly Asn lie Gin Gin His Thr Gin Glu Leu Phe lie 
485 490 495 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
500 505 510 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 
515 520 525 

Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 
530 535 540 

Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 
545 550 555 560 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 
565 570 575 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
580 585 590 

Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 
595 600 605 

Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 
610 615 620 

Pro Asp Thr Pro lie lie Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 
625 630 635 640 

Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 
645 650 655 

Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
660 665 670 

Phe He Ala Lys lie Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 
675 680 685 

Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 
690 695 700 

Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 
705 710 715 720 

Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
725 730 

(2) INFORMATION FOR SEQ ID NO ; 18 ; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CCAGTGAATT CCTAATACGA CT AC C TAT AG GTTAAAACAG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATGAACCCT C GAG AC C CAT TATG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CC AC CAAGTA CGTAACCACA TATGG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTGAGGACTG CTGG 
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(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CACCACTGCC CTCGAGAAGC TCACTATTG 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CACCACTGCC CTCGAGAAGC TCACTATTG 
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